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FIG. 7 



1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

4 01 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

8 01 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
8 51 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
901 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

10 01 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAG CTGGAA 

12 01 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

13 51 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

14 01 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGT ATT AC C A ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1601 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAAC CTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TT C AC T C AG C 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

2 01 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
351 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 T CAT CAAGCG ATGTTTCAGA TAGACAAAAA CTT CAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
6 01 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
8 01 AT ATGAAAT C AAGAAC CAGA ATATTTTATT GCACTCATTT TAATAGAAAT 

8 51 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 

9 01 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 

10 01 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGT CGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGT CACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CC CAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

12 51 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
16 01 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

16 51 AC CGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

17 01 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGAT CC 

18 01 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 

18 51 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

19 01 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 0 01 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AG C AAAGCAA 

22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
2251 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

23 01 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
23 51 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA 

24 51 TTTTGATTAC AACTCAAGAG 

25 01 ATAAACGTAA GTCGTGAAAA 
2551 GACTAGTTTT CCATTAAGTC 

26 01 GTGTTGAGGA GCAAAATATT 

26 51 TCAATTGATA TGAAAACTCT 

27 01 AGAAGGAATT CTGTGTACAC 
2751 CAATGTGGCT CAAGAAGAAA 
2801 CATTATTTTA GAAAGACGAT 

2 851 CAACAAGTTA TTTATATCAG 
2901 AATACAAGGA CCACTTTAAG 
2951 TTAGAGGTAT CTAAAATTAT 

3 001 CCTTGTGTGC AATATTAAGG 
3 051 ACTTTTTCCT TAGCACACTG 
3101 AAGTACATTT TCAACAGAGT 
3151 GCTAAAAAGT GACCAATGTC 
3 201 CTATTCTAAC TTATTTTGGA 
3 251 TGGGGTTTTG GGGTTTTGGG 



ATCTTCTAAT GAGACTTACA GATGACTATC 
AATAATGCAG TATTGTTTAT TGAGAAACTT 
TGGATTTAAA TTCAATATGA AGAAACTACA 
CAAGCAAATT TGCAAAATAC GGAATGGATA 
GTTCAAGATT ACTGCGATTG GATTGGCATC 
TGCTTTAATG CCAAATATTA ACTTGAGAAT 
TCAATCTAAA CATG CAAAC A AAG AAAG CAT 
CTAAAGTCGT TTTTAATGAA TAACATTACC 
TACAACCGAA GACTTTGCGA ATAAAACTCT 
GCGGTTACAA ATACATGCAA TGAGCCAAAG 
AAGAACTTAG CTATGAGCAG TATGAT CGAC 
ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
ATACAATTTT TGGAGAGGAG CATTAT C C AG 
AAGCACTTTA TTGAAATATT CAG CACAAAA 
TTGCATGATC CTCAAGGCAA AAG AAG C AAA 
AATCTCTAAT TCAATATGAT GCATAGTCGA 
AAGTTAATTT TCAATTTTTG TCTTATATAC 
GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

2 01 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNN I S AMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

3 01 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 

3 51 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KN FQKKVKKY VELNKHELIH 

4 01 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 

4 51 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 

5 01 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS WREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KY I FNRVCM I LKAKEAKLKS DQCQSLIQYD A 
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C CC C AAAAC C C C AAAAC CCCAAAAC CCCTATAAAAAAAGAAAAAATTGAGGTAGTTT AG A ■ 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

PQNPKTPKPL*KKKKLR*FR 
PKTPKPQNPYKKRKN* GSLE - 
PKPQN PKTP I KKEKI EVV* K - 

AATAAAATATTATTC C CGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

NKILFPHKWRWILIWMI*KI 
I KYYSRTNGDGY* FG* YRKF 
*NI I PAQMEMDIDLDDI ENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

YFL IHSTS IAALVVTRKDAK 
TS*YIQQV*QLL* *QERMQN- 
LPNTFNKYS SSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

H*NLARNRLH*LFQSCKNN* 
IEIWLEIAFIDYSKVAKTIR 
LKSGSKS PSLTI PKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

SSTSRMQIFITILS*EN*F* 
VLLLGCKSL*RFFLEKI SFK- 
FYFSDANLYNDS F LRKLVL 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

KAESKE* KLKHY*CLNKIR* 
KRRAKSRN*NITNV* IKSGN- 
SGEQRVE I ETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

*GLFYFLDHFLRSIMEKIT* 
EDYS I F* ITS*GALWRKLLN- 
RI ILFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWI ISLANNDEYIKF 

c LKGKQFGLFP* PTMMS I L N S - 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRI S IHQTYQRQTRY 

b I*E*VKGSRYIRLTKDKLAI- 

C YENESKDLDTSDLPKTNSL * - 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 
541 + + + + + + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKV* *SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTI R 

c NARKSL I I EQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 + + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*TPES*DN 

b MGFITIVLGIDGELPSLETI 

C WVLLQLF*VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a *KSCLQLKESQF*KF*CVCH 

b EKAVYN*RNRSSESSDVYAI 

C KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN* SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

c FCELI SNILSQFNG* L* KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKS FGTNAH 

b QINHASLMEYTLNPLGQMHT 

C K*TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD*FS 

b EFILDS * S IDTQNALETDLA- 

C NLYWILKA* IHRML*RLI *L- 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLI SLKEA 

b YNRLPVLITLAHLLiYL* KKQ- 

c TTDYLF* LLLL I SYI FKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEI SKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNEKKTKERDFKIC* FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS * F 

b ELTTR I LATKKKKSYHNPDS 

c N*QQEY*QRKRRRAI TILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS 

C KDFKNSR*ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

C FHFTAVI FFYLNNI F* LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b * KVSNKRSARLR* LSLFTFI 

C KKYQ I REALD* GNLAYSHS * - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

C IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * CYED * I FRVKKWSRNLNQK 

b SAMRTKFLESRNGAE I LI KK- 

c VL*GLNF*SQEMEPKS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL* IFR* *VLP 

b NCVDIAKESNSKSFVNKYYQ- 

C IASILQKNRTLNLSLISITN- 

ATCTTGATTGATTG7U\GAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRD*RGNCTEDH*RNK 

b S*LIEEIDEATAQKIIKEIK- 

C LD*LKRLTRQLHRRSLKK*S- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + -- + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN* ITNIEI SDLQ 

b *LLLIRE*TKLLI * R S A I F N - 

C NFY* LENKLNY*YRDQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KLN*S*TIKNTNLGQN 

b *RNKS*TKVRQ*KIQTLVKI- 

C DEIKA ELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ* IK* 

b LRKEKKTS *QKKK*GNK*NE 

c *GRKRRPVSKRKNKAINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFI FFNNLLKRGV 

b YRSEEIKDLFFSIIY*KEGF- 

c TEVKK* KI YFFQ* FI E KRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
C GVLGFW 
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2 EVDVQNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|::: | :.::|: :| |.::| ::. . | |..|.|.| 

19 ELELEMQENQNDIQVRVK. . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVTARKEVF . STGLMIELIDKCLVELL 100 

: .|.| ..|.. .|: ..:| | ...::|.: 
63 DERRYIITKALL . . . . EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 107 

101 SSSDVSDRQKLQCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 15 0 

..: .|: |:.|| : 

108 CWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTKYLIFQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 2 00 

| :|| | .: ..|.... ::: :: 

145 FDATEFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . .PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

.::|| .::|..: . |... ::|. .| ..|:. |.: 
182 SEFNE YQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

248 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 2 97 

|.: ...| :. | .: : s|:..: | ||. |:.|| 

221 . RKENLLTKLQAIKESEDKSKRETG DIMNVEDAI KALKPAVMKKI 264 

298 AYMLE KVKDFNFNYYLTKS CPL PENWRERKQKI ENL INKTREE KS KYYEE 347 
| . . :| : . | |.| |.| :.: 

2 65 AKRQNAMK KHMKAPKI PNSTLESKYLTFKD 2 94 

34 8 LFSYTTDNKCVTQFINEFFYNILPKDFLTGRNRKNFQKKVKKYVELNKHE 3 97 

h I .|-M.|.: : s| -I 

295 LIKFCHISEP KERVYKILGKKYPKTEEEYKAAFGDSASAPFN . PE 33 8 

3 98 LIHKNLLLEKINTREISWMQVETSAKHFYYFDHENIYVLWKLLRWIFEDL 44 7 

| |.. :| .|:| .: . :..|. : : .| ... :|| : : 
339 LAGKRMKIEI SKTWENELSAKGNTAEVWDNLI SSNQLPYMAMLRNLSN . . 386 

44 8 WSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 4 97 

hi ::.| 

3 87 ILKAGVSD 3 94 

4 98 KEVEEWKKSLGFAPGKLRLI PKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 54 7 

3 95 TTHS 3 98 

54 8 KLLNSHLMLKTLKNRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 5 97 

| :|. |.|:..|: 

399 IVINK I CEPKAVENSKM 415 

5 98 FFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKNNIVID 64 7 

| :. :.|| | :.. :|..| .: :.:|| |: |:: 

416 F . . PLQFFSAIEAVN . EAVTKGFKAKK . . . RENMNLKGQIEAVKE . . WE 457 

648 SKNFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 6 97 

..: ||:| .||..: ..:..| |. .| :. 

458 KTDEEKKDM ELEQTEEGEFVKVNEGIGKQYINSIELAIK 4 96 

698 AKQRN YFKKDNLLQ P VIN I CQ YNY I NFNGKF YKQTKG I PQGLCVS S I LS S 747 
. . . . : |. ::. .| :|: 

4 97 I AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGAKKYGS VRTCLECALVLGL 54 6 

748 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFI 7 97 

|.||: ::...| : 
547 MVKQRCEKSSFYIFSSPSSQCNKCYLEVDL 576 

FIG. 13A 
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798 EKLINVSRENGFKFNMKK . LQTSFPLSPSKFAKYGMDSVEEQNI VQDYCD 846 

-|.| ||.. .|:.: --«. ::::| I 

577 PGDELRPSMQKLLQEKGKLGGG . . TDFPYECIDEWTKNKTHVD 617 

847 WIGIS I DMKTLALM PN INLR I EG I LCTLNLNMQTKKASMWLKKKLKS FLM 8 96 

l-l II-. - .:|hl -|: - ||-|. : 

618 NIVILSDMMIAEGYSDINVRGSSIVNSI KKYKDEVN 653 

897 NNITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD . HFKKNLAM 945 

||. : .:. |::: |::. :.|:.: :: | ::| 

654 PNIKIF. . .AVDLEGYG KCLNLGDEFNENNYIKIFGM 687 

946 SSMIDLEVSKIIYSVTRAFFKYLVCNIKDTIFGEEHYPDFFLSTLKHFIE 995 

I • I :|:: ... :. r::| 
688 SDSI LKFISAKQGGA NMVE 7 06 

996 I FSTKKYI FNRVC 1008 

707 VI . . KNFALQKIG 717 



FIG. 13B 



132 LSTQKQYFFQDEWNQVRAMIGNEL . FRHLYTKYL I FQRTSE . . GTLVQFC 178 

:| ..| ..-.Mill = I I I • 

1 MSRRNQ KKPQAP I GNETNLDFVLQNLE VYKSQ I EH YKTQQQQ I 4 3 

17 9 GNNVFDHLKVNDKFDKKQKGGAADMNE PRCCSTCKYNVKNEKDHFLNNIN 22 8 
:: :. ||..:. :. ..|...| :| | .|..:.:|..:| 

44 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

2 2 9 VPNWNNMKSRTRI FYCTHFNRNNQFFKKHEFVSNKNNI SAMDRAQT I FTN 278 

|::|| |:.| : . .| 

85 QIKQQVQLIKK. . . VGSKVEKDLNLNEDENKKN 114 

27 9 IFRFNRIRKKLKDKVIEKIAYMLEKVKDFNFNYYLTKSCPLPENWRERKQ 328 

...|- • : | | . . | • - - : : | | - 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDY 164 

329 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE . FFYNI LPKDFLTG 377 

..|.::: .:::| .:|. : | : | : ||::. 

165 DTEKWFE I SHDQK NYVS I YANQKTS YCWWLKDYFNK 2 00 

3 7 8 RNRKNFQKKVKKYVELNKHEL IHKNLLLEKINTRE I SWMQVETS AKHFYY 427 

201 NNYDHLNVSINRLE. . TEAEFYAFDDFSQTIKLTNNSYQTVNID 242 

428 FDHENIYVLWKLLRWI . . FEDLWSLIRCFFYVTEQQKSYSKTYYYRKNI 475 

.: :| Ill" :| : s ||: : | -| -«.|. . 

243 VNFDNNLCILALLRFLLSLERFNILNIRSSY. . TRNQYNFEKIGELLETI 290 

476 WDVIMKMSIADLKKETLAEVQEKEVEEWKKSLGFAPGKLRLIPKKTTFRP 525 

:.|:. . .:|: .:: ... . |. |.. 

2 91 FAWFSHR HLQGIHLQVPCEAFQYLVNSSSQI SVKDSQLQ 33 0 

52 6 IMTFNKKIVNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 5 75 

: |...: | |.|: . |:|.. .|. :. . . || |. 

331 VYSFSTDLKLVD. . TNKVQDYFKFLQEFPRLTHVSQQAI PVSATNAVENL 378 

FIG. 14A 
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576 DDVMKKYE E FVC KV7 KQVGQ P KL F FATMDIEKCYDS . . VNREK 615 

: .:|| :. | |. | :. :.:| :.::| 

379 NVLLKKVKH ANLNLVSIPTQFNFDFYFVNLQHLKLEFGLEPNILTKQK 426 

516 LSTFL KTTKLLSSDFWIMTAQILKRKNNI . . VIDSKNFRKKEMK 657 

|...| h hi h- II Ih : ... II-... ■ 

42 7 LENLLLS IKQSKNLKFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQ 4 76 

558 DYFRQKFQKIALEGGQYPTLFSVLEN . . EQNDLNAKKTL I VE AKQRN Y F K 7 05 

. . • h II I : h • - • -hi I 

4 77 EETPETKDETPSESTSGMKFFDHLSELTELEDFSVN. . . . LQATQEIY . . 52 0 

706 KDNLLQPVINICQYNYINFNGKFYKQTKGI PQGLCVSS I LSSFYYATLEE 755 

II I -I |.||: 
521 .DSLHKLLIRSTNLKKFKLSYKYEMEKSKMDTFIDLKNI YETLNN 564 

756 SSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFIEKLINVSR 3 05 

h I I -Ih I Ih Ih 
565 LKRCSVNISNPHGNISYELTN KDSTFYKFKLTLNQE 500 

806 ENGF KFNMKKLQTS F PL S P S KFAKYGMD S VE EQN I VQD YCDW I G I S I DMK 855 

|:.:| |..|.:. | ||.: .|:|. : ::..|..|: : ::. 
601 LQHAKYTFK. . QNEFQFNNVKSAKIESSSLESLEDIDSLCKSIASCKNLQ 648 

856 TLALMPNINLRIEGILCTLNLNMQT. . KKASMWLKK . . KLKSFLMNNITH 901 

| ::|s. |:.... |.. :::|. .||.s .|. 
649 NVNI IASLLYPNNIQKNPFNKPNLLFFKQFEQLKNLENVSINC 691 

902 YFRKTI . . . TTEDFANKTLNKLFISGGYKYMQCAKEYKDHFKKNLAMSSM 94 8 

. | | | || | .:|: :|.. ||. 

692 I LDQH I LNS I SE FLE KNKKI KAF I LKRY YLLQ Y YLD YTKLFKTLQQL PEL 741 

949 IDLEVSKIIYSVT RAFFKYLVCNIKDT . . IFGEEHY 982 

... : | :||:. |. ||:. .:. .: 

742 NQVYINQQLEELTVSEVHKQWENHKQKAFYEPLCEFIKESSQTLQLIDF 791 

983 PDFFLS TLKHF I E I FSTKKY IFNRVCMILKAKEAKLKSDQCQSLIQ 1028 

.: :| .| | :|..|| . |: ... | ..::.|.|:. 

792 DQNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLK 84 0 

FIG. 14B 



4 DIDLDDIENLLPNTFNKYSSSCSDKKGCKTLKSGSKSPSLTIPK 4 7 

::. ..||. | |.. :||-|.... :|| -h 

617 NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNP 666 

48 LQKQLEFYFSDANLYNDSFLRKLVLKSGEQRVE . . . . IETLLM 86 

: ||:| . . .|: :::| :|.| .: :| |..::: 
6 67 FNKPNLLFFKQFEQLKNLENVS INC ILDQH I LNS I S EFLEKNKKI KAF I L 716 

F/G. 15 
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1 MEMD I DLDD I ENL LPNTFNKYSSSCSDKKGCKTLKSGSKSPS . . . 42 

|: |. . .|| . . . | . . | : | | . . . : . . | | | . . : 

4 91 I ELAI KI AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGAKKYGS VRTCLEC 54 0 

43 . LTIPKLQKQ LEFYFSDANLYNDSFLRKLVLKSGEQRVEIETLL 85 

|.: : || : :: |... :|.::| .: |.::| |..::.|| 

541 AL VLGLMVKQRC E KS S FY I FSSPSSQCNKCYL . EVDLPGDELRPSMQKLL 58 9 

FIG. 16 



telomerase p43 
human La 
Xenopus LaA 
Drosophila La 
S . c . Lhplp 



LQK QL E F YF SDA N LYN D S FL RKLVLKSGEQR V E I ETLLM 
ICHQUEYYFGDFNLPRDKFL KEQI . KLDEGWV PLEIMIK 
ICE2IEYYFGDHNLPRDKFLKQQI . LLDDGWV PLETMIK 
ILR2VEYYF GDA N LNR D K FL REQ IGKNEDGW V PLS VL VT 
CLKQVEF YF SEFN FPYDRFL RTTAEK . NDGWV PISTIAT 
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181 
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361 
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601 
661 
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aactcattta 
tagatttaat 
gccaaaagcc 
tatataagtt 
atgtttgttg 
taaagcactt 
catccgtaat 
ccacaagaat 
tgacttactg 
caaaaatttg 
taagtgttta 
taagtattgc 
caagtaaaag 
ggcaataaag 
agatgcaatc 
catgaagaaa 
gaccttcaag 
gatccttggt 
tgcatctgca 
aacatgggaa 
ttcaagcaat 
cggtgtttca 
tgagaactcc 
agttactaag 
agcagtaaag 
aaccgaagaa 
cattgaactt 
tgcaatcttc 
gtatggttcc 
acgttgtgaa 
ttacttagaa 
agagaaagga 
aaagaataaa 
atattcagat 
tgaagtaaat 
taatctaggt 
aatcttaaag 
ctttgccctt 
ctcaccccac 
atttaagtta 
aaagaacaaa 



attactaatt 
ttagaaagta 
gaaaaattgt 
agggttaaga 
taggaaggta 
cttgaggtgg 
gaactttaca 
actcaaccat 
gaagtctgtg 
tatcttgata 
caaagatgcg 
actgaatcct 
tgggattaaa 
gaatctgaag 
aaggctttaa 
cacatgaagg 
gatctcatta 
aaaaaatacc 
cccttcaatc 
aatgaactca 
taactcccat 
gatactacac 
aagatgttcc 
ggattcaagg 
gaagttgttg 
ggagaatttg 
gcaatcaaga 
tctgatgttt 
gttcgtactt 
aagtcctcat 
gttgatctcc 
aaacttggtg 
actcacgtag 
atcaatgtta 
cctaacatta 
gatgagttca 
ttcatttcag 
caaaaaatag 
ttttttgttt 
cttacatagt 
aaagattaaa 



taatcaacaa 
tcaattgaaa 

ggtgggaact 

ttgacgatcc 
gttactacta 
ctgagtctga 
tcagaactac 
tcatcgaaaa 
aatttgcata 
ggatactttc 
tcagaagcaa 
aacgtaagaa 
ctaagaagaa 
ataagtccaa 
aaccagcagt 
cacctaaaat 
agttctgcca 
ctaagaccga 
ctgaattggc 
gtgcaaaagg 
atatggccat 
actctattgt 
ctcttcaatt 
ccaagaagag 
aaaaaaccga 
ttaaagtcaa 
tagcagttaa 
ctggttctat 
gtctcgagtg 
tctacatctt 
ctggagacga 
gtggtactga 
acaatatcgt 
gaggcagttc 
aaatctttgc 
atgaaaacaa 
ccaagcaagg 
gacaaaagtg 
tattgcatag 
ttatgtatcg 
a 



gattgataaa 
aatggaaatt 
tgaattagag 
taagcaatat 
agataaagat 
tcctgagttc 
cactaactac 
gtacttcaac 
ggttctctat 
ataagatatt 
gttttctgaa 
aacaatgttc 
gagaaaagag 
gagagaaact 
tatgaagaaa 
tcctaactct 
tatttctgag 
agaggaatac 
tggaaagcgt 
caacactgct 
gttacgtaac 
gatcaacaag 
ctttagtgcc 
agaaaatatg 
tgaagagaag 
cgaaggaatt 
caagaattta 
gagtacctca 
tgcattagtc 
cagttcacct 
actccgtcct 
tttcccctat 
tattttgtct 
cattgttaac 
agttgactta 
ctacatcaag 
aggagcaaat 
agtttcttga 
ccattatgaa 
cagtctatta 



aagcagtaaa 
gaaaacaact 
atgcaagaaa 
ctcgtgaacg 
gaaagaagat 
atctgctagt 
attgtagcat 
aaagcagtac 
atttttgatg 
cgtaaggaac 
ttcaacgaat 
cgttacctct 
aatctcttaa 
ggagacataa 
atagccaaga 
accttggaat 
cctaaagaaa 
aaagcagcct 
atgaagattg 
gaggtttggg 
ttgtctaaca 
atttgtgagc 
attgaagctg 
aatcttaaag 
aaagatatgg 
ggcaagcaat 
gatgaaatca 
atgtcaggtg 
cttggtttga 
agttctcaat 
tctatgtaaa 
gagtgcattg 
gatatgatga 
agcatcaaaa 
gaaggttacg 
atattcggta 
atggtcgaag 
gattcttcta 
atttaaatta 
gcctattcaa 



taaaacccaa 
aagcacaata 
accaaaatga 
tcactgcagc 
atatcatcac 
tggcagtcta 
tttgtgttgt 
ttttgcctaa 
caactgaatt 
tcactttccg 
actaacttgg 
cagttaccaa 
ccaaacttta 
tgaacgttga 
gatagaatgc 
caaagtactt 
gagtctataa 
ttggtgattc 
aaatctctaa 
ataatttaat 
tcttaaaagc 
ccaaggccgt 
ttaatgaagc 
gtcaaatcga 
agttggagta 
acattaactc 
aaggacacac 
gagccaagaa 
tggtaaaata 
gcaataagtg 
aacttttgca 
atgaatggac 
ttgcagaagg 
agtacaagga 
gaaagtgcct 
tgagcgattc 
ttatcaaaaa 
taacaaaaat 
ttatctattt 
atgattctgc 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCWHKNTQPFIEKYFNKAVXiLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRS KF S E FNE YQLGKYCTE S QRKKTMFR YL S VTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKI PNSTLESKYLTFKDL I KFCH I SEPKERV 
YKI LGKKYPKTEEE YKAAFGDSASAPFNPELAGKRMKI E I S KTW 
ENE L S AKGNTAE VWDNL ISSNQL P YMAMLRNL SN I LKAGVS DTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNL KGQ I EAVKEWE KTDE EKKDMELEQTEEGE FVKVNEG I G 
KQYINS IELAI KI AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGA 
KKYGS VRTCLECALVLGLMVKQRCEKS S FY I FS S PS SQCNKC YL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNY I KI FGMSDS I LKFI S AKQGGANM 
VEVIKNFALQKIGQK 

FIG. 20 

MSRRNQKKPQAP I GNETNLDFVLQNLEVYKSQ I EHYKTQQQQ I K 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQL I KKVGS KVEKDLNLNEDENKKNGLSEQQVKEEQLRT I TEE 
QVKYQNLVFNMD YQLDLNESGGHRRHRRETD YDTEKWFE ISHDQ 
KNYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNF YT YVAQETSRKQ I LKQATT I KNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKI KAF I LKRYYLLQYYLD YTKLFKTLQQLPELNQVY INQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
-ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDWNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
NYTV I QFNGQ F FTQ I VGNRCNE PHL P P KWVQRS S S S S ATAAQ I K 
QLTE PVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I FPTNLVKI PQRLKVR INLTLQKLLKRHKRLNYVS I LNS I C PPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAI CF I SWLFRQL I PKI IQTFFYCTE I SSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
IPKKSNNEFRI IAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKR PTSFTKIYS PTQ I ADR I KE F KQRLLKKFNNVL PELYFMK 
FDVKSCYDS I PRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNASRVPKPYELYIDNVRTVHLSNQDVINWEMEIFKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLIISTDQQQVINIKKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
S KG I FRS L I AL FNTR I S YKT I DTNLNS TNT VLMQ I DHWKN I S E 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
HIVN 

FIG. 23 
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1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 

61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 

121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 

181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 

241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 

3 01 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 

3 61 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 

421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 

481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 

541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 

601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 

661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 

721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 

781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 

841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 

901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 

961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 

1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 

1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 

1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 

1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 

1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 

1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 

1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 

1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 

1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 

1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 

1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 

1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 

1741 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 

1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 

1861 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 

1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 

1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 

2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 

2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 

2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 

2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 

2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 

2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 

2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 

2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 

2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 

2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 

2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 

2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 

2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 

2821 aaaaaatcg 



FIG. 21 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNWLLMRLT 

FIG. 24 



ATTTATACT CATGAAAAT C TTATT CGAGTT C ATT CAAGACAAGCTTGACATTGAT CTAC A 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATG AGGATTATAC C AAAAAAAAGTAATAATGAGTT CAGGATTATTGC CAT C C CATG C AG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TG CGAAAGC CAATAGAGAC AAAATTTTAG CCGTAAGCTC C CAAT C AGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIG. 26 
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AKFLHWLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFI PKPDGLRP I VNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 



FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 



FIG. 28 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
T I ETS I TQNKSARKE VS WNS I S I SRFSI F YRS S YKKFKQDL YFNLHS I CDRNTVHMWLQW I FPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNWMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGliNNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLAS FAQVFIDITHNSKFNSCCN 
IYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 



FIG. 29 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 

S-2: RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 



A-l: AKF LHW LMS VYV VEL LRS FFY VTE TTF Q 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 

FIG. 32 



Poly 4 

t t c 

ta a g c c teg 
5 1 - cag acc aaa gga att cca taa gg 
QTKGIPQG 

4 (B' ) 



5(c») 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5 1 

a a aaaaa a a 

t t t t 

c c 
Poly 1 



FIG. 34 
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Vector 



Genomic DNA Insert 



Vector 



A5 



B2' 



5825 bp Sequenced HE 
2 kb Hind III Fragment 



1 kb 



FIG. 33A 



tezr 



Introns 12 3 



RT Motifs 12 3(A) 4{B') 5(C}6(C) 

m ' 



56 7 8 9 10 11 12 131415 



Hind III Xcal Xca I 



Hind III 



4-1 cDNA 
2-3 & 5-20 cDNA 




Original PCR 
3' RT-PCR 




5' RT-PCR W/M2-B 14 
5' RT-PCR W/M2-B15 Band A 
5' RT-PCR W/M2-B15 Band B 
5' RT-PCR W/M2-B16 Band C 



500 bp 



FIG. 33B 
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FIG. 39 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 .Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 
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GCCAAGTTCCTGCACTGGCTG 
10 

arg ser phe phe tyr val 
AGG TCT TTC TTT TAT GTC 

30 

leu phe phe tyr arg lys 
CTC TTT TTC TAC CGG AAG 

40 

gly ile arg gin his leu 
GGA ATC AGA CAG CAC TTG 

60 

glu ala glu val arg gin 
GAA GCA GAG GTC AGG CAG 

70 

thr ser arg leu arg phe 
ACG TCC AGA CTC CGC TTC 

90 

ile val asn met asp tyr 
ATT GTG AAC ATG GAC TAC 

100 

glu lys ala glu arg 

GAA AAG ARG GCC GAG CGT 

120 

ser val leu asn tyr glu 
AGC GTG CTC AAC TAC GAG 

130 

ala ser val leu gly leu 
GCC TCT GTG CTG GGC CTG 

150 

phe val leu arg val arg 
TTC GTG CTG CGT GTG CGG 

160 

phe val lys val asp val 
TTT GTC AAG GTG GAT GTG 

180 

asp arg leu thr glu val 
GAC AGG CTC ACG GAG GTC 

190 

thr tyr cys val arg arg 
ACG TAC TGC GTG CGT CGG 



1 

met ser val 
ATG AGT GTG 



thr glu thr 
ACG GAG ACC 



ser val trp 
AGT GTC TGG 



lys arg val 
AAG AGG GTG 



his arg glu 
CAT CGG GAA 



ile pro lys 
ATC CCC AAG 



val val gly 
GTC GTG GGA 



leu thr ser 
CTC ACC TCG 



arg ala arg 
CGG GCG CGG 



asp asp ile 
GAC GAT ATC 



ala gin asp 
GCC CAG GAC 



thr gly ala 
ACG GGC GCG 



ile ala ser 
ATC GCC AGC 



tyr ala val 
TAT GCC GTG 



tyr val val 
TAC GTC GTC 

20 

thr phe gin 
ACG TTT CAA 



ser lys leu 
AGC AAG TTG 

50 

gin leu arg 
CAG CTG CGG 



ala arg pro 
GCC AGG CCC 

80 

pro asp gly 
CCT GAC GGG 



ala arg thr 
GCC AGA ACG 

110 
arg val lys 
AGG GTG AAG 



arg pro gly 
CGC CCC GGC 

140 

his arg ala 
CAC AGG GCC 



pro pro pro 
CCG CCG CCT 

170 

tyr asp thr 
TAC GAC ACC 



ile ile lys 
ATC ATC AAA 

200 
val gin lys 
GTC CAG AAG 



glu leu leu 
GAG CTG CTC 



lys asn arg 
AAG AAC AGG 



gin ser ile 
CAA AGC ATT 



glu leu ser 
GAG CTG TCG 



ala leu leu 
GCC CTG CTG 



leu arg pro 
CTG CGG CCG 



phe arg arg 
TTC CGC AGA 



ala leu phe 
GCA CTG TTC 



leu leu gly 
CTC CTG GGC 



trp arg thr 
TGG CGC ACC 



glu leu tyr 
GAG CTG TAC 



ile pro gin 
ATC CCC CAG 



pro gin asn 
CCC CAG AAC 



ala ala met 
GCC GCC ATG 



FIG. 47A 
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gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys. tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



leu gin thr val •cys 
CTC CAG ACG GTG TGC 

400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



110 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 

390 

thr asn ile tyr lys 
ACC AAC ATC TAC AAG 



ala cys val leu gin 
GCA TGT GTG CTG CAG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 



ile leu leu leu gin 
ATC CTC CTG CTG CAG 

410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 



FIG. 47B 
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gin val trp lys asn 
CAA GTT TGG AAG AAC 

430 

arg leu pro leu leu 
CGG CTC CCT CTG CTA 



val ala gly gly gin 
GTC GCT GGG GGC CAA 

460 

arg ala val ala val 
CGT GCA GTG GCT GTG 



thr pro cys his leu 
ACA CCG TGT CAC CTA 

490 

pro asp ala ala glu 
CCA GAC GCA GCT GAG 



pro gly gly arg ser 
CCT GGA GGC CGC AGC 

520 

his pro gly leu met 
CAT CCT GGA CTG ATG 



thr pro ala ala leu 
ACA CCA GCA GCC CTG 

550 

arg gly gly pro his 
AGG GGC GGC CCA CAC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 

FIG. 47C 



pro his phe ser cys 
CCA CAT TTT TCC TGC 



leu his pro glu ser 
CTC CAT CCT GAA AGC 

450 

gly arg arg arg pro 
GGG CGC CGC CGG CCC 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 

480 

arg ala thr pro gly 
CGT GCC ACT CCT GGG 



ser glu ala pro gly 
TCG GAA GCT CCC GGG 

510 

gin pro gly thr ala 
CAA CCC GGC ACT GCC 



ala thr arg pro gin 
GCC ACC CGC CCA CAG 

540 

ser arg arg ala tyr 
TCA CGC CGG GCT TAT 



pro gly leu his arg 
CCA GGC CTG CAC CGC 



ala ser ser leu thr 
GCG TCA TCT CTG ACA 

440 

gin glu arg arg asp 
CAA GAA CGC AGG GAT 



ser ala leu arg gly 
TCT GCC CTC CGA GGC 

470 

ala gin ala asp ser 
GCT CAA GCT GAC TCG 



val thr gin asp ser 
GTC ACT CAG GAC AGC 

500 

asp asp ala asp cys 
GAC GAC GCT GAC TGC 



leu arg leu gin asp 
CTC AGA CTT CAA GAC 

530 

pro gly arg glu gin 
CCA GGC CGA GAG CAG 



thr ser gin gly gly 
ACG TCC CAG GGA GGG 

560 

trp glu ser glu ala 
TGG GAG TCT GAG GCC 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

2 01 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

2 51 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

3 01 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

4 01 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
4 51 GGGGAG CGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
651 CAACGGGCCT GGAAC CAT AG CGT CAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTG CGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
801 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
851 GAC CGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGC CAC 
901 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

1001 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGAC CAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

12 01 GCGNTACTGG CAAATG CGG C CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1251 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

13 01 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

13 51 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

14 01 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
14 51 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAG CAT 
1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
16 01 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

16 51 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAG CAAGT 

18 01 TG CAAAG CAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1851 CTGTCGGAAG CAGAGGT C AG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

19 01 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
1951 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2 0 01 GCCGAGCGTC TCACCTCGAG GGTGAAGG CA CTGTTCAGCG TGCTCAACTA 
2 051 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
2201 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AG CAT CAT CA 

22 51 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 

23 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 

23 51 AGAC CT C C AG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 

24 01 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC C CTGAATGAG 
2451 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 

FIG. 50A 
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2 501 CGTGCGCATC AGGGGCAAGT 

2 551 GCTCCATCCT CTCCACGCTG 

2601 AACAAGCTGT TTGCGGGGAT 

2 651 GGATGATTTC TTGTTGGTGA 

2 7 01 TCAGGACCCT GGTCCGAGGT 

2 751 CGGAAGACAG TGGTGAACTT 

2 801 GGCTTTTGTT CAGATGCCGG 

2 851 TGCTGGATAC CCGGACCCTG 

2 901 CGGACCTCCA T C AG AG C C AG 

2 951 GAGGAACATG CGTCGCAAAC 

3 001 GCCTGTTTCT GGATTTGCAG 
3 0 51 AT CTAC AAGA TCCTCCTGCT 
3101 GCAGCTCCCA TTTCATCAGC 
3151 GCGTCATCTC TGACACGGCC 
3 2 01 AACGCAGGGA TGTCGCTGGG 

32 51 CGAGGCCGTG CAGTGGCTGT 

33 01 GACACCGTGT CACCTACGTG 

33 51 ACGCAGCTGA GTCGGAAGCT 

34 01 CGCAGCCAAC CCGGCACTGC 
34 51 GGCCACCCGC CCACAGCCAG 
3 5 01 GCCGGGCTCT ACGTCCCAGG 
3 551 ACCGCTGGGA GTCTGAGGCC 
36 01 CCGGCTGAAG GCTGAGTGTC 
36 51 GGGCTGAGTG TCCAGCACAC 
3 701 CTCGGCTCCA CCCCAGGGCC 
3 751 CTCCCCACAT AGGAATAGTC 
3 8 01 GCCCTGCCCT CCTTTGCCTT 
3 851 GAAGGACCCT GGGAGCTCTG 
3 901 TACACAGGCG AGGACCCTGC 
3 951 GGGGGGAGGT GCTGTGGGAG 
4001 TTGAAAAAAA AAAAAAAAAA 



CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
CTCTGCAGCC TGTGCTACGG CGACATGGAG 
TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
CACCTCACCT CACCCACGCG AAAACCTTCC 
GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
GAGGTG C AG A GCGACTACTC CAGCTATGCC 
TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
GTGAACAGCC TCCAGACGGT GTGCACCAAC 
GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
TCCCTCTGCT ACTCCATCCT GAAAGC CAAG 
GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
CCACTCCTGG GGTCACTCAG GACAGCCCAG 
CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
CCTCAGACTT CAAGACCATC CTGGACTGAT 
GCCGAGAGCA GACAC CAGC A GCCCTGTCAC 
GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
CATCCCCAGA TTCGCCATTG TTCACCCCTC 
CCACCCCCAC CAT C C AGGTG GAG AC C CTGA 
GGAATTTGGA GTGAC CAAAG GTGTGCCCTG 
ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
TAAAATACTG AATATATGAG TTTTTCAGTT 
AAAAAAAAA 



FIG. SOB 



+ 



+ 



63/78 



GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 



X + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGSPGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

c SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

61 + + + + + + 120 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL 

c ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

a AAGHVRAAPGAPGLAAGAAR 

b PLATFVRRLGPQGWRLVQRG- 

C RWPRS CGAWG PRAGGWC S AG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCGANTGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

a GPGGFPR?GGP??GVRALG? 

b DPAAFRA?VA?C?VCVPW??- 

C TRRLSARWWP?AWCACPG??- 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ?AAPRRPLLPPGVLPE??GG 

b ??PPAAPSFRQVSCL??LVA- 

c G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

301 + - + + + + + 360 

GGCTCACGACGTOmCGACACGCTOGCGCC 

a PSAA?AVR?RRE?RAGLRLR 

b RVL??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQRAQL 

b LLDGARGGPPEAFTTSVRSY- 

c CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

c CPTR* PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR ? ? CAG 

b RVGDDVLVHLLARCA? F V L V - 

C AWATTCWFTCWHAAR ? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

541 + + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

a GSQLRLP7VRAAAVPARRC? 

b ?PSCAY?VCGPPLYQLGAAT- 

c ? PAAPT ? CAGRRCTSSAL ? L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCC7^ACGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

j 

a SGPAPATR?WTR?RLGSNGP 

b QARP P P H A ? GPE 7VWDPTGL 

c RPGPRHTL ? DP 7ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

661 + + + + + + 720 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

a GTIASGRPGS PWAASPGCEE 

b EP*RQGGRGPPGLPAPGARR- 

C NHSVREAGVPLGCQ PRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

721 + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

a ARGQCQPKSAVAQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

c AGAVPAEVCRC PRGPGVAL P- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

a P*AGADARWAGVLGPPGQDA 

b PEPERTPVGQGSWAHPGRTP- 

C LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

a WTE* PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPIRGPPA 

b SLEGALSGTRHSHPSVGRQH 

C LWRVR S LARAT PTH PWAAS T- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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a PRGPPIHIAATTSWDTPCPP 

b HAGPPSTSRPPRPGTRLVPR- 

C TRAPHPHRGHHVLGHALSPG- 

GTGTACGCCGAGACCAAGCACTTCCTCT^ 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATG^ 

a VYAETKHFLYSSGDK? TA?L 

b CTPRPSTSSTPQAT?TLRPS- 

C VRRDQALPLLLRRQ?HC ? P P - 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTT 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

a LPTQYI *GPA*LAFGRFVET 

b FLLNISEAQPDWRSGGSWR? 

C SYS IYLRPSLTGVREVRGD?- 

OTCTTTCTGGTTCCAGGCCTTGGATGCC^ 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAO 

a ? FLVPGLGCQDSPQVAPPAP 

b SFWFQALDARI PRRLPRLPQ 

C LSGSRPWMPGFPAGCPACPS- 

GCGOTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTC 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACG^ 

a A?LANAAPVSGAAWE PRAVP 

b RYWQMRPLFLELLGNHAQCP- 

c ? TGKCGPCFWSCLGTTRSAP- 

CTACGGGGTGTTCCTCAAGACGGACTGCCCGCTC 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

a LRGVPQDALPAASCGHPSSR 

b YGVFLKTHCPLRAAVTPAAG- 

c TGCSSRRTARCELRSPQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACAC^ 

a CLCPGEAPGLCGGPRGGGTQ 

b VCAREKPQGSVAAPEEEEHR- 

C SVPGRS PRALWRPPRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

a TPVAWCSCSASTAAPGRCTA 

b PPSPGAAAPPAQQPLAGVRL 

C PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a SCGPACAGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

C VRACLRRLVPPGLWGSRHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATC 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL- 

c RRFLRNTKKFI SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCT 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

a RCRS *RGR*ACGTALGCAGA 

b AAGADVEDERAGLRLAAQEP- 

C LQE LTWKMSVRDCAWLRRS P - 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGA 

1621 + + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

a QGLAVFRPQSTVCVRRSWPS 

b RGWLCSGRRAPSA*GDPGQV- 

c GVGCVPAAEHRLREE I LAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTC 

a SCTG**VCTSSSCSGLSFMS 

b PALADECVRRRAAQVFLLCH- 

c LHWLMSVYVVELLRS FFYVT- 

CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTC 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTC 

a RRPRFKRTGSFSTGRVSGAS 

b GDHVSKEQALFLPEECLEQV- 

C ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTT^ 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

a CKALESDST*RGCSCGSCRK 

b AKHWNQTALEEGAAAGAVGS- 

C Q S I GI RQHLKRVQLRELS EA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH 

C EVRQHREARPALLTSRLiRF I- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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a SPSLTGCGRL* TWTTSWEPE 

b PQA* RAAADCEHGLRRGSQN 

C PKPDGLR P IVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGC^CTGTTCAGCG 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGC 

a RSAEKRGPSVSPRG*RHCSA 

b VPQRKEGRASHLEGEGTVQR- 

c FRREKRAERLTSRVKALFSV- 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

a CSTTSGRGAPASWAPLCWAW 

b AQLRAGAAPRPPGRLCAGPG- 

C LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

a TISTGPGAPSCCVCGPRTRR 

b RYPQGLAHLRAACAGPGPAA- 

c DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGAC^CCATCCCCCAGGAC^ 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

a LSCTLSRWM*RARTTPSPRT 

b *AVLCQGGCDGRVRHHPPGQ 

c ELYFVKVDVTGAYDTI PQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTO 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTT^ 

a GSRRSS PASSNPRTRTACVG 

b AHGGHRQHHQTPEHVLRASV- 

c LTEVIAS I I KPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

a MPWSRRPPMGTSARPSRATS 

b CRGPEGRPWARPQGLQEPRL 

C AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTC^ 

a LP*QTSSRTCDSSWLTCR?T 

b YLDRPPAVHATVRGS PAG?Q 

C TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGOTCCTC^ 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GMPSSSSRAPP*MRPAV 

b PAEGCRRHRAELLPE * GQQW 

C PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCA^ 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTO\ 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

C LFDVFLRFMCHHAVR I RGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

a PTSSARGSRRAPSSPRCSAA 

b LRPVPGDPAGLHPLHAALQP- 

c YVQCQGI PQGSILSTLLCSL- 

TGTGOTACGGCGACATGGAGAACAAGCTGTTTGCGGG^ 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

a CATATWRTSCLRGFGGTGCS 

b VLRRHGEQAVCGDSAGRAAP- 

C CYGDMENKLFAG I RRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTTGG 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGT 

a CVWWMISCW*HLTSPTRKPS 

b AFGG*FLVGDTSPHPRENLP- 

c RLVDDFLLVTPHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGC^ 

2701 + + + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATAC^ 

a SGPWSEVSLSMAAW*TCGRQ 

b QDPGPRCP*VWLRGELAEDS- 

c RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTG^^ 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAA 

a W*TSL* KTRPWVARLLFRCR 

b GELPCRRRGPGWHGFCSDAG 

C VNFPVEDEALGGTAFVQMPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + 4- + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

a PTAYSPGAACCWIPGPWRCR 

b PRPI PLVRPAAGYPDPGGAE 

c HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTG 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQSQSHLQPRL 

C DYSSYARTS IRASLTFNRGF- 

TCAAGGCTGGGAGGAAGATGCGTCGCAAACTCTTO 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTCAC 

a SRLGGTCVANSLGSCG* SVT 

b QGWEEHASQTLWGLAAEVSQ 

C KAGRNMRRKLFGVLR LKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGra 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

a ACFWICR* TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD 

C LFLDLQVNS LQTVCTN I Y K I - 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGC 

3061 + + + + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

a SSCCRRTGFTHVCCSSHFIS 

b PPAAGVQVSRMCAAAPISSA- 

c LLLQAYRFHACVLQL PF HQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCT^ 

3121 + + + + + + 3180 

TTCAAACCITCTTGGGGTGTAAAAAGGACGCGC^GTAGAGACTGTGCCGGAGGGAGACGA 

a KFGRTPHFSCASSLTRPPSA 

b SLEEPHI FPARHL*HGLPLL- 

c VWKNPTFFLRVI SDTASLCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTAC^GCGACCCCCGGTTCCCGCGGCGGCCGG 

a TPS * KPRTQGCRWGPRAPPA 

b LHPESQERRDVAGGQGRRRP- 

C SILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

a LCPPRPCSGCATKHSCSS*L 

b SALRGRAVAVPPS I PAQADS 

C LPSEAVQWLCHQAFLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

a DTVSPTCHSWGHSGQPRRS * 

b TPCHLRATPGVTQDS PDAAE 

C HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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a VGSSRGRR* LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA 

C RKL PGTTLTALEAAAN P A L P - 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCAC^ 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

C SDFKTILD*WPPAHSQAESR- 

GAC^CCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACA 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSS PVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

c HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGG^ 

3541 + + + + + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE*VFGRGLHV- 

c RPAPLGV*GLSECLAEACMS- 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 + + + + + + 3660 

GGCCGACTTCCGACTCAC^GGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC- 

c G* RLSVRLRPERVS SQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCAC^^ 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPG P- 

c QHTCRLHF PTGWRSAP PQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

c LFLTRS PASTPHIG IVHPQ I - 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCT^^ 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCS PLALPS FAFH PHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTT^ 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

C DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTC^ 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV- 

c HRRG PCTWMGVPVGQ I GGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTT^ 

3961 + + + + + + 4020 

CGACACCCTCATTTTATGACTTATATACT 

a AVGVKY* IYEFFSFEKKKKK 

b LW E *NTEYMSFSVLKKKKKK- 

c CGSKILNI *VFQF * KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

a K K K 

b K K 

c K K 
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i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser ieu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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leu 
CTG 


gly 
GGA 


cys 
TGC 


glu 

GAA 


arg 
CGG 


ala 
GCC 


trp 
TGG 


asn 
AAC 


his 
CAT 


ser 
AGC 


val 
GTC 


arg 
AGG 


glu 
GAG 


ala 
GCC 


gly 
GGG 


val 
GTC 


pro 
CCC 


leu 
CTG 


gly 
GGC 


leu 
CTG 


pro 
CCA 


ala 
GCC 


pro 
CCG 


220 
gly 
GGT 


ala 
GCG 


arg 
AGG 


arq 
AGG 


arg 
CGC 


gly 
GGG 


gly 
GGC 


ser 
AGT 


ala 
GCC 


ser 
AGC 


230 
arg 
CGA 


ser 
AGT 


leu 
CTG 


pro 
CCG 


leu 
TTG 


pro 
CCC 


lys 
AAG 


arg 
AGG 


pro 
CCC 


arg 
AGG 


240 
arg 
CGT 


gly 
GGC 


ala 
GCT 


ala 
GCC 


pro 
CCT 


glu 
GAG 


pro 
CCG 


glu 
GAG 


arg 
CGG 


thr 
ACG 


250 
pro 
CCC 


val 
GTT 


gly 
GGG 


gin 
CAG 


gly 
GGG 


ser 
TCC 


trp 
TGG 


ala 
GCC 


his 
CAC 


pro 
CCG 


260 
gly 
GGC 


arg 
AGG 


thr 
ACG 


arg 
CGT 


giy 

GGA 


pro 
CCG 


ser 
AGT 


asp 
GAC 


arg 
CGT 


gly 
GGT 


270 
phe 
TTC 


cys 
TGT 


val 
GTG 

gly 
GGT 


val 
GTG 

ala 

GCG 


ser 
TCA 

leu 
CTC 


pro 
CCT 
290 
ser 
TCT 


ala 
GCC 

gly 
GGC 


arg 
AGA 

thr 
ACG 


pro 
CCC 

arg 
CGC 


ala 
GCC 

his 
CAC 


280 
glu 
GAA 

ser 
TCC 


glu 
GAA 

his 
CAC 


ala 
GCC 

pro 
CCA 


thr 
ACC 

ser 
TCC 


ser 
TCT 

val 
GTG 


leu 
TTG 
300 
gly 
GGC 


glu 
GAG 

arg 
CGC 


gin 
CAG 


his 
CAC 


his 
CAC 


ala 
GCG 


gly 
GGC 


pro 
CCC 


pro 
CCA 


ser 
TCC 


310 
thr 
ACA 


ser 
TCG 


arg 
CGG 


pro 
CCA 


pro 
CCA 


arg 
CGT 


pro 
CCC 


trp 
TGG 


asp 
GAC 


thr 
ACG 


320 
pro 
CCT 


cys 
TGT 


pro 
CCC 


pro 
CCG 


val 
GTG 


tyr 
TAC 


ala 
GCC 


glu 

GAG 


thr 
ACC 


lys 
AAG 


330 
his 
CAC 


phe 
TTC 


leu 
CTC 


tyr 
TAC 


ser 
TCC 


ser 
TCA 


gly 
GGC 


asp 
GAC 


lys 
AAG 


glu 

GAG 


340 
gin 
CAG 


leu 
CTG 


arg 
CGG 


pro 
CCC 


ser 
TCC 


phe 
TTC 


leu 
CTA 


leu 
CTC 


ser 
AGC 


ser 
TCT 


350 
leu 
CTG 


arg 
AGG 


pro 
CCC 


ser 
AGC 


leu 
CTG 


thr 
ACT 


gly 
GGC 


ala 
GCT 


arg 
CGG 


arg 
AGG 


360 
leu 
CTC 


val 
GTG 


glu 
GAG 


thr 
ACC 


ile 
ATC 


phe 
TTT 


leu 
CTG 


gly 
GGT 


ser 
TCC 


arg 
AGG 


370 
pro 
CCC 


trp 
TGG 


met 
ATG 


pro 
CCA 


gly 
GGG 


thr 
ACT 


pro 
CCC 


arg 
CGC 


arg 
AGG 


leu 
TTG 


380 
pro 
CCC 


arg 
CGC 


leu 
CTG 


pro 
CCC 


gin 
CAG 


arg 
CGC 


tyr 
TAC 


trp 
TGG 


gin 
CAA 


met 
ATG 


390 
arg 
CGG 


pro 
CCC 


leu 
CTG 


phe 
TTT 


leu 
CTG 


glu 
GAG 


leu 
CTG 


leu 
CTT 


gly 
GGG 


asn 
AAC 


400 
his 
CAC 


ala 
GCG 


gin 
CAG 


cys 
TGC 


pro 
CCC 


tyr 
TAC 


gly 
GGG 


val 
GTG 


leu 
CTC 


leu 
CTC 


410 
lys 
AAG 


thr 
ACG 


his 
CAC 


cys 
TGC 


pro 
CCG 


leu 
CTG 


arg 
CGA 


ala 
GCT 


ala 
GCG 


val 
GTC 


420 
thr 
ACC 


pro 
CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



arg 
CGG 


glu 

GAG 


lys 
AAG 


thr 
ACA 


asp 
GAC 


pro 
CCC 


pro 
CCC 


trp 
TGG 


460 
gin 
CAG 


val 
GTG 


pro 
CCC 


pro 
CCA 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


leu 
CTC 


ser 
TCG 


leu 
CTG 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


glu 
GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


met 
ATG 


550 
ser 
AGT 


val 
GTC 


thr 
ACG 


glu 
GAG 


lys 
AAG 


ser 
AGT 


580 
val 
GTC 


leu 
TTG 

gin 

CAG 


lys 
AAG 

his 
CAT 


arg 
AGG 
610 
arg 
CGG 


phe 
TTC 


ile 
ATC 


pro 
CCC 


tyr 
TAC 


val 
GTC 


640 
val 
GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 

arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 

gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 
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650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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leu val thr pro his 
TTG GTG ACA CCT CAC 

890 

leu val arg gly val 
CTG GTC CGA GGT GTC 



lys thr val val asn 
AAG ACA GTG GTG AAC 

920 

thr ala phe val gin 
ACG GCT TTT GTT CAG 

gly leu leu leu asp 
GGC CTG CTG CTG GAT 

950 

ser ser tyr ala arg 
TCC AGC TAT GCC CGG 



arg gly phe lys ala 
CGC GGC TTC AAG GCT 

980 

val leu arg leu lys 
GTC TTG CGG CTG AAG 



asn ser leu gin thr 
AAC AGC CTC CAG ACG 

1010 

leu gin ala tyr arg 
CTG CAG GCG TAC AGG 



his gin gin val trp 
CAT CAG CAA GTT TGG 

1040 

ser asp thr ala ser 
TCT GAC ACG GCC TCC 



ala gly met ser leu 
GCA GGG ATG TCG CTG 

1070 

ser glu ala val gin 
TCC GAG GCC GTG CAG 



leu thr arg his arg 
CTG ACT CGA CAC CGT 



880 



leu 
CTC 


thr 
ACC 


his 
CAC 


ala 
GCG 


lys 
AAA 


pro 
CCT 


glu 
GAG 


tyr 
TAT 


giy 

GGC 


cys 
TGC 


phe 
TTC 


pro 
CCT 


val 
GTA 


910 
glu 
GAA 


asp 
GAC 


met 
ATG 

thr 
ACC 


pro 
CCG 

arg 
CGG 


ala 

GCC 

thr 
ACC 


his 
CAC 
940 
leu 
CTG 


giy 

GGC 

glu 
GAG 


thr 
ACC 


ser 
TCC 


ile 
ATC 


arg 
AGA 


ala 
GCC 


gly 
GGG 


arg 
AGG 


asn 
AAC 


970 
met 
ATG 


arg 
CGT 


cys 
TGT 


his 
CAC 


ser 
AGC 


leu 
CTG 


phe 
TTT 


val 
GTG 


cys 
TGC 


thr 
ACC 


1000 
asn ile 
AAC ATC 


phe 
TTT 


his 
CAC 


ala 
GCA 


cys 
TGT 


val 
GTG 


lys 
AAG 


asn 
AAC 


pro 
CCC 


1030 
thr phe 
ACA TTT 


leu 
CTC 


cys 
TGC 


tyr 
TAC 


ser 
TCC 


ile 
ATC 


gly 
GGG 


ala 
GCC 


lys 
AAG 


1060 
gly ala 
GGC GCC 


trp 
TGG 


leu 
CTG 


cys 
TGC 


his 
CAC 


gin 
CAA 


val 
GTC 


thr 
ACC 


tyr 
TAC 


1090 
val pro 
GTG CCA 



thr phe leu arg thr 
ACC TTC CTC AGG ACC 

900 

val val asn leu arg 
GTG GTG AAC TTG CGG 



glu ala leu gly gly 
GAG GCC CTG GGT GGC 

930 

leu phe pro trp cys 
CTA TTC CCC TGG TGC 

val gin ser asp tyr 
GTG CAG AGC GAC TAC 

960 

ser leu thr phe asn 
AGT CTC ACC TTC AAC 



arg lys leu phe gly 
CGC AAA CTC TTT GGG 

990 

leu asp leu gin val 
CTG GAT TTG CAG GTG 



tyr lys ile leu leu 
TAC AAG ATC CTC CTG 

1020 

leu gin leu pro phe 
CTG CAG CTC CCA TTT 



phe leu arg val ile 
TTC CTG CGC GTC ATC 

1050 

leu lys ala lys asn 
CTG AAA GCC AAG AAC 



ala gly pro leu pro 
GCC GGC CCT CTG CCC 

1080 

ala phe leu leu lys 
GCA TTC CTG CTC AAG 



leu leu gly ser leu 
CTC CTG GGG TCA CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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